|
In statistics, dispersion (also called variability, scatter, or spread) denotes how stretched or squeezed〔()〕 a distribution (theoretical or that underlying a statistical sample) is. Common examples of measures of statistical dispersion are the variance, standard deviation and interquartile range. Dispersion is contrasted with location or central tendency, and together they are the most used properties of distributions. ==Measures of statistical dispersion== A measure of statistical dispersion is a nonnegative real number that is zero if all the data are the same and increases as the data become more diverse. Most measures of dispersion have the same units as the quantity being measured. In other words, if the measurements are in metres or seconds, so is the measure of dispersion. Such measures of dispersion include: * Sample standard deviation * Interquartile range (IQR) or interdecile range * Range * Mean absolute difference (also known as Gini mean absolute difference) * Median absolute deviation (MAD) * Average absolute deviation (or simply called average deviation) * Distance standard deviation These are frequently used (together with scale factors) as estimators of scale parameters, in which capacity they are called estimates of scale. Robust measures of scale are those unaffected by a small number of outliers, and include the IQR and MAD. All the above measures of statistical dispersion have the useful property that they are location-invariant, as well as linear in scale. So if a random variable ''X'' has a dispersion of ''SX'' then a linear transformation ''Y'' = ''aX'' + ''b'' for real ''a'' and ''b'' should have dispersion ''SY'' = |''a''|''S''''X''. Other measures of dispersion are dimensionless. In other words, they have no units even if the variable itself has units. These include: * Coefficient of variation * Quartile coefficient of dispersion * Relative mean difference, equal to twice the Gini coefficient There are other measures of dispersion: * Variance (the square of the standard deviation) – location-invariant but not linear in scale. * Variance-to-mean ratio – mostly used for count data when the term coefficient of dispersion is used and when this ratio is dimensionless, as count data are themselves dimensionless, not otherwise. Some measures of dispersion have specialized purposes, among them the Allan variance and the Hadamard variance. For categorical variables, it is less common to measure dispersion by a single number; see qualitative variation. One measure that does so is the discrete entropy. 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Statistical dispersion」の詳細全文を読む スポンサード リンク
|